Full-duplex Speech-to-text System for Estonian

نویسنده

  • Tanel Alumäe
چکیده

The paper describes a distributed online speech-to-text system. The main features of the system are real-time speech recognition and full-duplex user experience, meaning that the partially recognized utterance is progressively displayed to the user during speaking. Other benefits include easy client-server communication protocol and system scalability to many concurrent user sessions. The paper also describes two Estonian speech-to-text applications based on the developed framework: a general-domain dictation application with an estimated word error rate of 26.4% and a radiology report dictation system with a word error rate of 13.7%. The system is open-source and based on free software.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Text-to-speech synthesis of estonian

The aim of this text-to-speech synthesis system is to convert the Estonian orthographic text to an orthoepically correct and natural-sounding spoken text for a wide range of practical application (especially for visual and speech disabled persons).

متن کامل

Transcription System for Semi-Spontaneous Estonian Speech

This paper describes a speech-to-text system for semi-spontaneous Estonian speech. The system is trained on about 100 hours of manually transcribed speech and a 300Mword text corpus. Compound words are split before building the language model and reconstructed from recognizer output using a hidden event Ngram model. We use a three pass transcription strategy with unsupervised speaker adaptation...

متن کامل

Designing a Speech Corpus for Estonian Unit Selection Synthesis

The article reports the development of a speech corpus for Estonian text-to-speech synthesis based on unit selection. Introduced are the principles of the corpus as well as the procedure of its creation, from text compilation to corpus analysis and text recording. Also described are the choices made in the process of producing a text of 400 sentences, the relevant lexical and morphological pref...

متن کامل

A Full-Duplex, Dual-Polarization 10Gbps Radio over Fiber system with wavelength reuse for upstream signal

This study presents a full-duplex Radio-over-Fiber (RoF) system providing the users' wireless access with a bit rate of 10 Gbps over 40 GHz radio carrier. This system can be used in a centralized radio access network (C-RAN) architecture because we provide a fully analog front haul link between central station and base station. We can consider it as infrastructure between remote radio heads (RR...

متن کامل

Modelling Speech Temporal Structure for Estonian Text-to-speech Synthesis: Feature Selection

The article discusses the principles of selecting features for modelling the temporal structure of Estonian speech, using different types of read-out texts, with a view to text-tospeech synthesis (TTS). Feature selection is known to depend on certain general issues regulating speech temporal structure, as well as on some language specific aspects. The durational model of Estonian stands out for...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014